Reaction kernels: predicting enzyme functions you have never seen before
نویسندگان
چکیده
Motivation: Enzyme function prediction is an important problem in post-genomic bioinformatics. There are two general methods for solving the problem: annotation transfer from a similar annotated protein, and machine learning approaches that treat the problem as classification against a fixed taxonomy, such as Gene Ontology or the EC hierarchy. These methods are suitable in cases where the function of the new protein is indeed previously characterized and included in the taxonomy. However, given a new function that is not previously described, these approaches arguably do not offer adequate support for the human expert. Results: In this paper, we explore a structured output learning approach, where enzyme function—an enzymatic reaction—is described in fine-grained fashion with so called reaction kernels which allow interpolation and extrapolation in the output (reaction) space. A structured output model is learned via Maximum Margin Regression to predict enzymatic reactions from sequence motifs. We bring forward several choices for constructing reaction kernels and experiment with them in the remote homology case where the functions in the test set have not been seen in the training phase. Our experiments demonstrate the viabilty of our approach. Availability: Software and datasets will be made available by the time of the conference. Computing Reviews (1998)
منابع مشابه
Reaction Kernels - Structured Output Prediction Approaches for Novel Enzyme Function
Abstract: Enzyme function prediction problem is usually solved using annotation transfer methods. These methods are suitable in cases where the function of the new protein is previously characterized and included in the taxonomy such as EC hierarchy. However, given a new function that is not previously described, these approaches arguably do not offer adequate support for the human expert. In t...
متن کاملTowards structured output prediction of enzyme function
BACKGROUND In this paper we describe work in progress in developing kernel methods for enzyme function prediction. Our focus is in developing so called structured output prediction methods, where the enzymatic reaction is the combinatorial target object for prediction. We compared two structured output prediction methods, the Hierarchical Max-Margin Markov algorithm (HM3) and the Maximum Margin...
متن کاملThe secret world of silent PPOs.
Have you ever been handed an EOB that looked . . . well . . . strange? Maybe it had some unfamiliar corporate name and logo you'd never seen before. Then, as you read down, did you end up scratching your head, wondering why your original charges had been discounted so heavily, and for a patient who didn't seem to be in any network you belonged to? If so, you may have been the unwitting victim o...
متن کاملجستاری میان رشته ای در اصول طراحی اتاق خواب با استناد به آموزه های نَقلی مکتب اسلام
The Holy Quran and hadiths as the most important Islamic religious texts have significant capacities in various fields of human knowledge which have been often neglected. Among the issues raised in this religious texts are some of architectural problems. One of these problems is quality of bedroom design. In this article, this problem will be probed by citing verses and hadiths to deduct bedroo...
متن کاملThe basics of frame theory
This is the first part of a series of notes [3, 4, 5, 6, 7] on frames, with several more to come. In this part I set down the basic definitions and properties, give a few examples, and begin the discussion of nuclei. The last section looks at various categorical properties, and could be omitted at a first reading. 1 After that you should read [4] and [5] in the order you prefer. That will give ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009